Perceptual Aliasing in JCMB (or, Where on Earth is IPAB?)
نویسنده
چکیده
In the real world we usually have to rely upon what we can observe about our environment in order to judge our current state. Unfortunately our observations are inherently limited, and this can sometimes cause us to become confused and disorientated, losing track of exactly what state we are in. This confusion is called perceptual aliasing, and it occurs when our observations are not descriptive enough to allow us to uniquely identify our state. When encountered by artificial agents this perceptual aliasing phenomenon can seriously damage the agents ability to learn solutions to a problem, and this project looks at a method of overcoming this problem whilst learning with reinforcement learning algorithms. The general method of overcoming perceptual aliasing considered is called active perception — this is giving the agent control over its own sensors, so that when it encounters an aliased state it can attempt to adjust them to gain new information that resolves the ambiguity. In this project a particular type of active perception known as perceptual actions is discussed, with results being presented that confirm it can help alleviate the effects of perceptual aliasing. It is noted that there are actually two minor variations of the perceptual action approach, and both are analysed to evaluate their comparative performance. Additionally a new grid-world problem for reinforcement learning agents, based upon Edinburgh University’s James Clark Maxwell Building, is introduced. This problem is designed to suffer from perceptual aliasing and is used as an additional test problem for the algorithms studied. Finally some brief, but promising, results are given for a new adaption of the above technique, in which the agent learns in multiple observation spaces simultaneously.
منابع مشابه
Attenuation of spatial aliasing in CMP domain by non-linear interpolation of seismic data along local slopes
Spatial aliasing is an unwanted side effect that produces artifacts during seismic data processing, imaging and interpolation. It is often caused by insufficient spatial sampling of seismic data and often happens in CMP (Common Mid-Point) gather. To tackle this artifact, several techniques have been developed in time-space domain as well as frequency domain such as frequency-wavenumber, frequen...
متن کاملUnsupervised Classiication of Sensory-motor States in a Real World Artifact Using a Temporal Kohonen Map
Classiication is a fundamental act of cognition. It underlies nearly all learning, transfer of learning, generalization and abstraction. One of the well-known hard and fundamental problems is the one of perceptual aliasing, i.e. that the sensory stimulation caused by one and the same object varies enormously depending on the distance from the object, orientation, lighting conditions, etc. In th...
متن کاملThe Impact of Perceptual Aliasing on Exploration and Learning in a Dynamic Decision Making Task
Perceptual aliasing arises in situations where multiple, distinct states of the world give rise to the same percept. In this study, we examine how the degree of perceptual aliasing in a task impacts the ability of human agents to learn reward-maximizing decision strategies. Previous work has shown that the presence of perceptual cues that help signal distinct states of the environment can impro...
متن کاملClassification as Sensory-Motor Coordination: A Case Study on Autonomous Agents
In psychology classiication is studied as a separate cognitive capacity. In the eld of autonomous agents the robots are equipped with perceptual mechanisms for classifying objects in the environment, either by preprogramming or by some sorts of learning mechanisms. One of the well-known hard and fundamental problems is the one of perceptual aliasing, i.e. that the sensory stimulation caused by ...
متن کاملA Study of an Indirect Reward on Multi-agent Environments
In a multi-agent learning where multiple agents are learning, there is a problem about an indirect reward that is how to distribute a reward to an agent that does not obtain a reward directly.We have shown the theorem [3] about ”negative effect” of an indirect reward. This paper focuses on the ”positive effect” of an indirect reward such as an elimination of the perceptual aliasing problem [1]....
متن کامل